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Abstract — In recent years, the Internet into the Web2.0 era 
has brought the rapid development of social networks. But the 
huge user groups make it becoming increasingly difficult for the 
user to find like-minded potential friends. Therefore, almost all 
of the social networks provide recommended friend function 
based on the FOF algorithm, that is to say, friends of a friend 
will be recommended to the user. This approach mainly focuses 
on the relationship between users, but ignores the effect on 
which the user attributes will have on the formation for a new 
relationship between friends. Based on this, this paper proposes 
a new friend recommendation algorithm LWSNT (Local Walk 
based Social Network Trust), a limited steps walking algorithm 
based on the social network trust. The algorithm quantifies the 
user's attributes and serves as a reference index for the 
recommendation of friends, and introduces the trust mechanism 
based on the social network while taking into account the users’ 
existing relationships, which makes the user more receptive to 
the recommendation result. Experiments show that the 
algorithm proposed in this paper is better than some similar 
algorithms if the target user's good friend information is 
enough. 

Index Terms — Friend Recommendation; Social Network; 
User’s Attributes; Trust. 


I. Introduction 

Social networks are generally based on the six-degree 
segmentation theory [1], a user expands his communication 
circle by making new friends. In recent years, the Internet into 
the Web 2.0 era has brought the rapid development of social 
networking services, for example, Facebook, Sina microblog 
,QQ and other social networking platforms have millions or 
even hundreds of millions of large number of user groups, it 
becomes more and more difficult for users to find like-minded 
potential friends. Under this context, information 
recommendation based on social networking not only 
attracted great attention of researchers, but also have been 
widely used in social and online knowledge sharing sites. 
Compared with the traditional recommendation system, some 
of the recommendations based on social networks introduced 
a trust mechanism. Previous surveys from reference [2] have 
suggested that users prefer recommendations from friends 
rather than from the system. The studies in reference [3], [4] 
have confirmed the positive correlation between user interest 
similarity and user trust, you can increase the social 
networking site traffic and improve user dependency by 
providing users with satisfactory personalized service [5]. 
Therefore, to build a user's social trust network, and then to 
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implement personalized friend recommended based on it will 
create great research value. 

II. Related Works 

Jilin Chen, a researcher at GroupLens, explored the 
relationship between social interest and personal interest in 
information flow recommendations [6]. Le Yu proposed an 
adaptive social similarity calculation method based on matrix 
decomposition, which has been verified by Epinions [7]. Yu 
Haiqun and his collegue proposed a method based on user 
preference for social network recommendation [8]. Yuan T, 
Cheng J, Zhang X, et al. proposed a unified framework to 
properly incorporate the influence of social relationships into 
recommendation[9] . 

Chang WL, Diaz AN, Hung PCK mentioned that the trust 
value estimated will serve as a metric for filtering and sorting 
content of any kind based on the trustworthiness of the 
creator [10].Golbeck defines trust as: If user A determines that 
the behavior of user B will bring good results, then A will trust 
B. The rating of trust is generally measured by the degree of 
trust [11]. The similarity metric can be used as the calculation 
method of the trust between users in the social trust network. 
The method of local trust measurement in social network can 
be divided into the method based on node similarity and the 
calculation method of trust based on path. 

The method based on node similarity takes the common 
neighbors or the node degree at both ends as the consideration. 
The similarity index of the common neighbors is CN(the 
number of common neighbors) [1 1], as shown in Equation (1), 
and the Jaccard metric [12], as shown in Equation (2). The 
premise of these two indicators is that if the number of 
common neighbors between two nodes is larger, the similarity 
between them will be greater. 

Common-Neighbors : 

- IpGi) n p(v)\ (1) 


Jaccard index: 
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The trustworthiness calculation method based on the path is 
mainly directed to the indirect trust users. The TidalTrust 
algorithm [15] is a kind of calculation method based on 
breadth first search proposed by Golbeck. Bouraga S, Jureta I, 
Faulkner S utilized a trust inference based algorithm to 
measure reputation score of each individual service, and 
subsequently trustworthiness of their composition[13].The 
idea of LRW (Local Random Walk) [14] can be expressed as 
follows: Given a graph and a starting point, the walker starts 
at random and moves to any adjacent node of the starting 
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point. Adjacent nodes serve as new starting points. And the 
process will repeat and get a node sequence which is a random 
walk process. However, for an increasingly large social 
network, the global random walk has a large amount of 
computation which is almost computationally infeasible. 
Therefore, according to the six-dimensional theory [1], the 
LRW Friend algorithm only travels limited number of steps, 
that is, only consider the limited length of the path, generaly 2 
to 6. The algorithm terminates in a relatively stable state 
without pursuing a steady-state distribution. 

The Common-Neighbors and Jaccard metrics mentioned 
above are more concerned with the number of friends in 
common, and LRW Friend algorithm uses only the 
relationships among the points in the social graph, none of 
which used the attributes of the user to optimize the 
recommendation results. In this paper, we propose a new 
friend recommendation algorithm, LWSNT (Local Walking 
Based Social Network Trust), which combines the social 
graphs with user attributes to calculate the trustworthiness of 
social network users. The advantages of the algorithm are as 
follows: 

(1) to quantify the user's attributes and use it as a 
reference index for friends' recommendation; 

(2) to introduce the trust mechanism based on the social 
network diagram, and make the users more likely to 
accept the recommendation results; 

(3) to get a better recommendation than the relevant 
algorithm in the case of sufficient user information, 
especially in the TOP-N recommendation can be 
achieved quite good results. 

III. The Proposed Approach 
A. Problem modeling 

The relationship between users in a social network can be 
represented as a social graph of vertex sets and vertex pairs, 
where the vertices represent the users and the edges represent 
the social relationships between the users. In the social 
networks like Facebook, Renren and QQ, once A and B are 
connected, A is the friend of B and B is the friend of A, that is, 
the relationship between users is equal. The edge of the social 
graph is undirected, and it composing a undirected graph 
called the relational map [15]. 

Definition 1: According to graph theory, the social graph 
can be defined as G = (¥ f E} , where vertex set ¥ 
represents all users in the social network, and 
non-directed edge set E represents the relationship pair 
existed in the social network. When there is a good friend 
relationship between the and Vj, Vj is considered the 
adjacency point of vt .The social graph G can be 
represented by the adjacency matrix A = (a. Lj U E E . If 
there is a relationship between users, then = 1, 
otherwise a- L j = 0. This paper uses the form of adjacency 
list to store the social graph G. 

In this paper, A user other than a friend of the target user in the 
social graph is referred to as a recommended user. The social 
map is shown in Fig. 1 : 



Suppose in the social network, the user has c attributes [16], 
notated with ( x v x 2 > mmm t x if — , x € , i E 1 c), such as sex, 

age, home, place of residence, hobbies, occupation, label, age 
of social accounts (from the date of registration) and their 
mutual friends, etc. Where interests can be subdivided, and 
each attribute has multiple values. These attributes may be 
independent or may have some relationship between each 
other. For example, a young male has a strong interest in 
sports, which identifies that age and gender will have a 
greater impact on a person's interest. If the interactions 
between attributes are considered, the problem will become 
more complicated, and it will get more and more difficult to 
calculate the degree of trust. Therefore, this paper will 
calculate the trust between the social network nodes based on 
independent users’ attributes. 

B. Trust Calculation 

For convenience of description, the following definitions are 
given. 

Definition 2: The target user’s existing friend set is F(V), 
size is iwi. 

Definition 3: The non-buddy nodes of the target user in 
the social graph form the recommended set named as the 
candidate set C{V), and the size is C (v) I. 

Definition 4: The recommended list generated by the 
training set is JI(p) , size is R(v’)l , which is the 
recommended result of the algorithm. 

Definition 5: The user’s real behavior list is T (v), size is 

lrtv)l. 

IV. User attributes quantification 

The degree of influence of the i-th attribute on the trust degree 
of the target user is called "single attribute trust degree", 
represented by yi . 

There may be multiple values for a user attribute x Taking 
gender as an example, assume that sex is the first attribute. 
Target user A has 55 male friends, 45 female friends, total 
friends lF(i?)| =45+55=100. For a recommended male user B 
in the social graph, y t (x i = male) = 55/100 = 0.55. The 
calculation of a single-attribute trustworthiness is shown in 
Equation (3): 

yi{x; = values') = s { (x i: = values} f\F [v)\ (3) 

Where s-^Xi = values) is the number of occurrences of the 
i-th attribute x { = values for all the friends of the target user. 
As can be seen from the formula 3, y\ is only related to the 
attributes of user's friends. 

Let be the trust degree of the target user to the id-th 
recommended user. Equation (4) is the calculation formula of 

v 
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^ = 2!(/ X y;(*! = values'}') (4) 


In the above formula, f is introduced to represent the average 
influence of each attribute, the initial value of / = l/c. 
Considering that a certain attribute will have a greater impact 
on the recommended results, a constant Z[ is used to enlarge 
the f value of the i-th attribute, and the sum of the influence of 
all attributes should always be l,as shown in Equation( 5). 

SiOi x /) = l (5) 

So the new f value is calculated as shown in Equation (6). 

Vij», 

Combining the above four formulas, the formula of trust 
degree will be defined as. 

Yid ~ S?(z E x xs i (x i = values )/|f(i0l) (7) 


V. The effect of closely relation on trust degree 

In the social map mentioned in this paper, the target user can 
reach the adjacent node who is his friends, the node needed 2 
steps to reach is a friend of friends, and the node taken 3 steps 
to reach is a friend of a friend's friend. In real life, the trust 
between users decreases with the number of walking steps. 
According to the six-dimensional theory, the number of 

walking steps k ranges from 2 to 6. In this paper, we introduce 

k - 1 

a trust attenuation factor Of- = — , where t is the number of 

walking steps. The attenuation factor is combined with 
Equation 7 to obtain the final calculation formula (8). In this 
paper, the number of walking steps k = 4. 

Yid - x SiC^e x X Si(Xi = valves') f\F(v}\) (8) 
A. Algorithm Description 

Combining the social graph and the new trust calculation 
method introduced above, this paper proposed a new friend 
recommendation algorithm. 

The pseudocode of the friend recommendation algorithm 
LWSNT is shown in Algorithm 1. In step 3, the breadth-first 
search method is used to traverse the social graph and each 
point is traversed only once. Note that in the calculation of the 
trust degree V^, the result is related to z j ,y,- and the current 
step number t, and only one parameter t is related to the node 
traversal. The breadth-first search itself can be regarded as a 
level order traversal, The number of access steps t to the 
current node must be the smallest of all traversal methods, 
while the degree of trust obtained is the largest of the current 
node. The purpose of using the heap is to reduce the time 
complexity of step 4. In the calculation process, The number 
of friends of the target user is IfooI, the number of nodes is 
n, the number of edges is m, and the number of edges 
traversed by the algorithm is m v . The time complexity of Step 
2 is 0(c x |F(i?)l). The time complexity of steps 3 and 4 is 
0(m r X HlognN') . The time complexity of Step 5 is 
N ) . From the previous analysis, the time complexity 


of step 2 and step 5 is relatively small, so the overall time 
complexity of the algorithm is determined by the time 
complexity of steps 3 and 4, that’s to say, 0{m v x Ki!d£ 2 A r ). 


Algorithm 1 LWSNT 

Inputs: 

target user v a , recommendation set size N 
(recommendation N friends to the target user), user 
information data set (including attribute information and 
friend relationship information and C(u a )). And z^ 

(may not input, the default value is 1) 

Output: 

Candidate recommendation set consisting of Top-N 
recommended users for user v c . 

Step 1: 

Create a social graph based on existing relationships. 

Step 2: 

Use the user attribute information of F(v a ) to calculate the 
single attribute trust degree y ? i of user attribute x using 
Equation(3). 

Step 3: 

Walking from v a , using the breadth-first search method 
traverses the social graph. For the visited node v E C(v a ), 
the tag v has been accessed, the number of walking steps 
is recorded and Equation(8) is used to calculate the trust 
degree Each node only traverses once, that is, Y id is 
calculated only once. 

Step 4: 

Create a small top heap with the size N and the heap with 
the key pair < Jd >. According to the properities of the 
small top heap, if the new Y [d is larger than the heap key 
value of the trust V^, then pop-up the top elements of the 
heap and add a new key-value pair toensure that the Y [d of 
the elements in the heap is the current maximum N numbers. 
Step 5: 

The elements in the heap are the id and Y ^ of the 
recommended set R(u a 3. The top of the heap elements will 
be deposited into a structure array in turn, and then output 
the elements in the array in reverse order. 


VI. Experiments and Analysis 

A. Data Set and Preprocessing 

To test the algorithm, this paper got data from Sina 
microblogging (www.weibo.com) and stored in the database. 
Sina microblogging, as the most active user microblogging 
platform, provides a complete set of open source interface, 
users can call the API directly to get data from the Sina 
servers. For the protection of user privacy, this paper only 
uses the public information of microblogging users. First of 
all, it selected 60 users as the target user, called V L , 
downloaded their user information, and then downloaded a 
total of 4896 users' information from the 60 users’ friends, 
known as V 2 . Finally, it downloaded the user information of 
V 2 friends’, known as F 3 , a total of 363,458 users. The data 
set is preprocessed, for every target user, random select 10% 
of their friends as the test set of the experiment, the remaining 
90% as a friend set of the experiment. 

B. Experimental measurement methods 

A friend recommendation is a TOP-N prediction that 
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focuses on whether a user will see the recommended content. 
This prediction generally provides users with a personalized 
recommendation list, using the prediction accuracy and recall 
rate to measure the recommended accuracy rate [20] . 

Recall is calculated as shown in Equation (9): 

Recall = (9) 

Eirti'lrpol v ' 


The precision is calculated as shown in Equation (10): 


Precision 


(rim- (in | 


( 10 ) 


Recommended accuracy The FI value is calculated as 
shown in Equation (11): 


FI = 


2 >:P isiCrYl >:P.££-1 ! L 
Prsc izhn +££££[[ 


( 11 ) 


Obviously, the larger the FI Measure, the better the 
performance of the algorithm. 

C. Experimental results and analysis 

The experiment selected the recommended set size 
N= 1,2, 5, 10 and CN and Jaccard are compared with the 
LWSNT algorithm in this paper. Experiments were 
performed based on pre-processed data sets. There are 60 
target users in the dataset, 14 of which have less than 50 
friends and the remaining 46 have more than 50 friends. The 
experimental results are shown in Table. I and Table. II. 


Table. I the FI value of three kinds algorithms when the number of 
target users friends <50 



N = 1 

N = 2 

N = 5 

N= 10 

LWSNT 

0.3460 

0.1623 

0.1000 

0.0629 

CN 

0.4068 

0.1677 

0.1061 

0.0643 

Jaccard 

0.1086 

0.0620 

0.0482 

0.0327 


Table. II the FI value of three kinds algorithms when the number of 
target users friends >50 



N = 1 

N = 2 

N = 5 

N = 10 

LWSNT 

0.8123 

0.5171 

0.3846 

0.2353 

CN 

0.6429 

0.5080 

0.3820 

0.2474 

Jaccard 

0.5286 

0.4371 

0.3229 

0.2026 


Table. I shows the performance of the three algorithms when 
the number of target users is small. From the target user 
friends to obtain the effective information is not much of the 
case, the performance of the three algorithms are not very 
good. In contrast, CN's performance is the best, LWSNT and 
its little difference, Jaccard's performance is relatively poor. 
From Table. II and Table. I shows that the more friends of 
target users, the more effective information can obtained from 
friends, the better prediction accuracy of the algorithm. At 
this time, LWSNT has the highest accuracy, and only slightly 
worse than CN at N = 10. CN and Jaccard's performance has 
also been significantly improved. 
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s CN 
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Fig.2 the FI value of the three algorithms without regard to the 
number of target users 

It can be seen from Fig.2 that LWSNT algorithm is slightly 
better than CN, Jarccard's performance is acceptable but is 
much worse than the other two algorithms. 

Through the experimental study, the following conclusions 
can be drawn: 

(1) CN, Jaccard and LWSNT of this paper are affected by the 
effective information. The more the friends of target users, the 
better the performance of the three algorithms. 

(2) LWSNT algorithm in general have good performance but 
the recommended accuracy is higher at N = 1, 2, 5. 


VII. Conclusion 

This paper proposed a new friend recommendation 
algorithm based on social graph, LWSNT. Compared with 
other FOF algorithms, LWSNT utilizes the attributes of users 
to improve the accuracy of prediction. From the experimental 
results, the LWSNT algorithm presented in this paper is 
superior to the CN and the overall performance of Jaccard. 
This paper also finds that the performance of friend 
recommendation algorithm based on social graph is affected 
by the number of existing friends of the target user, and the 
more the number of existing friends is, the better the 
performance of algorithm is. In the case of a small number of 
friends, LWSNT algorithm performance is also acceptable. 
Of course, this study also received some restrictions. First of 
all, in the user attribute quantization, is assumed in a variety of 
properties completely independent of the case , the next study 
can consider the relationship between attributes. Second, the 
data set used in this experiment need to be extended. Finally, 
the time complexity of this algorithm is higher than that of CN 
and Jaccard, and the next research will focus on time 
efficiency optimization of the algorithm. 
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